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FIGURE 9 

1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
5 1 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
1 5 1 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
20 1 TCCAGAAAGT TATTAGATGA AGAAATC AAT CTCAAAGTCA TTATAAAGAT 
25 1 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTC CACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
35 1 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
45 1 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
501 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
55 1 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAAT ATTCCA 
60 1 GCGAACTTCT G AAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
651 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
M 701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 

El 75 1 GAATGAGAAA GATC ACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
El 801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
U | 851 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
* 901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
i J] 95 1 GAATTAGAAA GAAGCTAAAA G ATAAGGTTA TCGAAAAAAT TGCCTACATG 
1001 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 
1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 
1 101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTT TAGC 
1151 TACACAACTG ATAATAAATG CGTC ACACA A TTTATTAATG AATTTTTCTA 
I ! 1 20 1 CAATATACTC CCC AAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
Z 1 25 1 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
H 1 30 1 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 
n j 1351 GGTTGAG ACC TCTGC AAAGC All 1 I IATTA TTTTGATC AC G AAAAC ATCT 
1401 ACGTCTTATG G AAATTGCTC CGATGG ATAT TCGAGGATCT CGTCGTCTCG 
1 45 1 CTGATTAGAT GA I 1 1 1 ICTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1 501 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1 55 1 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1 60 1 GAAGAATGGA AAAAGTCGCT TGG ATTTGCA CCTGGAAAAC TCAGACTAAT 
1 65 1 ACCGAAG AAA ACTACTTTCC GTCCAATTAT G ACTTTCAAT AAGAAG ATTG 
1 70 1 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1 75 1 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1 85 1 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1 95 1 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
200 1 CTGCACAAAT TCTAAAGAGA AAG AATAACA TAGTTATCGA TTCGAAAAAC 
205 1 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGG ACAAT ATCC AACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGCA AAG AAAACAT TAATTGTTGA AGCAAAGCAA 
220 1 AGAAATTATT TTAAGAAAGA TAACTTACTT C AACC AGTCA TTAATATTTG 
225 1 CCAATATAAT TAC ATTAACT TTAATGGGA A GTTTTATAAA CAAA CAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
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FIGURE 9 (cont.) 



2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
240! CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
2451 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
2551 GACTAGTTTT CCATTAAGTC C A AGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
265 1 TC AATTG ATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
2751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
2851 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGGCAAAG 
2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
295 1 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 
fi 3001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
r 305 1 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
Si 3101 AAGTAC ATTT TCAAC AGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
5i 3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCA ATATG AT GCATAGTCGA 
ft) 3201 CTATTCTAAC TTATTTTGGA A AGTTA ATTT TCAATTTTTG TCTTATATAC 
M 3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 
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FIGURE 10 



1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RMQSQSHYKD 
51 LEDDGFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 
1 0 1 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDE WNQVRAM 
151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKJAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKJ ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKJDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LER.CFFYVTE QQKSYSKTYY YRKN1WDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDDEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNN1VIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKJCDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI . 
85 1 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 



FIGURE 11 

1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
5 1 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
1 0 1 TTTGGATGAT ATAG AAAATT TACTTCCTAA TAC ATTCAAC AAGTATAGCA 
1 5 1 GCTCTTGTAG TGACAAGAA A GG ATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGT TCTAC TT 
251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
351 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
50 1 GATCTCGATA C ATCAGACTT ACC AAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 
M 701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
fc] 751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
CI 801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
HI 85 1 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
«fl 901 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
01 951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
^ 1 00 1 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
^ 1 05 1 AAAGAAG AAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAA ATTCC 

1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTA TAG 1111 1CATTT 
CI 1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 

1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
,fsi 1 25 1 TCAC ATTC AT AGATCGACCT TCATATATCC AATACGATGA TAAG GAAACA 
? 1301 GCAGTC ATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
S 1351 AAG AAATGG A GCCG A AATCT T A ATC A AAAA GAATTGCGTC G ATATTGCAA 
™ 1401 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 

1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1 551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1 60 1 ATACAAACCT TGGTCAAAAT ATTGAGGAAG G AAAAG AAG A CCAGTTAGCA 
1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAG AAAT AA 
1 701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGGGG 
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FIGURE 12 



CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

! * ♦ ♦ ♦ 60 

GGCXnTTTCCGCTTTTCGGCTTTTCCGCAT^ 

a PQNPKTPKPL'KKKKLR *F R 

b PKTPKPQNPYKKRKNCGSLE- 

C PKPONPKTPIKKEKIE VV-K- 

AATAAAATATTATTCCCCCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

61 .* 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAA 

a NXILFPHKWRWILIW MI.. *KI 

b IKYYSRTNGDCYCFGC'Y.RKF- 
pjC * N I I P A 0 M E K D I O.L D D X B N L- 

£] TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

121 ♦ ----- - * ♦ 180 

ATG AAGGATT ATGT AAG TTGTTC AT ATCGTC G AG AAC ATC ACTGTTCTTTCCTACGTTTT 

a YFL IHSTS IAALVVTRKDAK 

b TS'YIQQV-QLL-'QERMQN- 
c LPNTTNKYSSS'CS DKKOCKT- 

f * ' CATTG AAATCTGGCTCG AAATCGCCTTC ATTG ACT ATTCC AAAGTTGC AAAAAC AATT AG 

s 181 * 240 

« s CT AACTTTAG ACCG AGCTTT AGCGGAAGT AACTGATAAGGTTTC AACCTTTTTCTTAATC 

teal 

jtnia HCNLARNRLHCLFQSCKNN 

i , b IEIWLEIAFI DYSKVAKTIR- 

f 3 * C LKSGSKSPSL TXPKLQKQLE- 

Hi 



AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

241 - - 300 

?'< | TCAAGATGAAGACCCTACGTTTAGAAATATTGCTAACAAAGAACTCTTTTAATCAAAATT 

a SSTSRHQI F I T I LSC EN " F 

b VLLLGCKSL • RFFLEK ISFK 

C FYrSDANLYNDSrLRKLVLK- 

AAAGCGCAGAGCAAAGAGTACAAATTCAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 - 360 

TTTCGCCTCTCGTTTCTC ATCTTT AACTTTCT AATG ATT AC AAATTT ATTTT AGTCC ATT 

a KAESKE* K.LKMY " CLNK I R 

b XRR. AKSHNCN ITNV * I K SGN 

C SGEQRVSI E T L L M r K * N Q V M - 

TG AGG ATTATTCTATT TT T T AC ATC ACTTCTT AACC ACC ATT ATGG AG AAAATT ACTT AA 

)61 42C 

ACTCCTAATAAGATAAAAAATCTACTGAAGAATTCCTCCTAATACCTCTTTT AATG AATT 

a CGLFYFLOHFLRS IHEKIT* 

b EDYSIF" ITS 'CALWRKLLN- 

c RIILFFRSLLKEHYCENYLI- 

TACTAAAAGGTAAACAGTTTGCATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

42 1 - - 480 

ATCATTTTCCATTTGTCAAACCTAATAAAGGCATCGCTTCTTACTACTCATATAATTTAA 

a Y-KVNSLDYFPSOOCCVY'i 

b TKR'TVW l I S LANNOEY I KF 

C LKCXOFGLFP'PTMMSILNS- 



w FIGURE 12 (cont.) w 

C A T ATG AG AATGAGTC AAAGG ATCTCG AT AC ATCAC ACTT ACC AAAGACAAACTCGCT AT 

481 — " 540 

CTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTCAATGGTTT^ 

a HMRMSQRISIHOTYQROTRY 
b ICECVKCSRY I RLTKOKL'A I 

YENESKOLDTSOLPKT'NSL 
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AAAACGC AAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTCCATTTACTATTCG 

— ♦ 600 

541 * * * * ouu 

TTTTCCCTTCTTTTTCAAACTATTACCTTCTC 

KTQEKVC "SN SRRTYC I YYS 
KRKKKFDNRTAEELIAFT IR- 
NARKSLI I EOOKNLLHLLFV- 

TATGGGTTTTATTACAATTCTTTTAGCTATCGACGGTGAACTCCCGAGTCTTC 

601 - " 660 

ATACCCAAAATAATCTTAACAAAATCCATAGCTCCCACTTGA 

YCFYYHCFRYRBCTP-ESCDM.- 
MGFITIVLG I OGELPSLETI - 
W V L L Q L F * VSTVNSRVLRQL- 



j»; tgaaAAAGCTCTTTACAACTGAACGAATCGCACTTCTCAAAGTTCTC 

Pi 661 * * 720 

U ACTTTTTCC AC AAATGTTG ACTTCCTT AC CGTC AAG ACTTTCAAGACT AC AC AT ACGGT A 

Pj 

■31 a CKSCLOLKESQFCKF-CVCH 

y\ b EKAVYNCRNRSSESSDVYAI- 

JIc KKLFTTEGIAVLK VLMCMPL- 

0' TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCT 

U 721 - *" -* * ♦ 780 

AT AAAA C A CTT AA TT AG A GTTT AT AG AAT AG AGTT AAATT ACCT ATCG AT ATCTTTGTTT 



YFVN'SQI SY LNLMDSYRNK 
ILCINLK YLISI'WIAIETN- 
FCELlSNILSQFNC'L-KOT- 



C C AAAT AAAC C ATGC AAGTTT AATCG AAT AT ACGTT AAATCCTTTGGG AC AAATGC AC AC 
GCTTTATTTGCTACCTTCAAATTACCTTATATCCAATTTACCAAACCCTGTTTACGTGTG 



Q a PNXPCK'FNC I YVKSFGTNAH 

n'jb QINHASLMEYTLNPLGQMHT 
Hi c K . T v g v • w N I R • I L W O K C T L - 



TG AATTT AT A TTGG ATTC TT AAAGC AT AG AT AC AC AG AATGCTTT AG AG ACTG ATTT ACC 
ACTT A"AA T AT AACC T AAC ,-ATTTCCT ATCT ATGTGTCTT ACC AAATCTCTG ACT AAATCG 



CIYiCFLKHRYTECFRDCFS 
b E F I LDS * S I DTQNALETD'LA 

c n l Y 'I L X A * I H R M L * R L I * L - 



TTACAACAGATTACCTGTTTTCATTACTCTTCCTCATCTCTTATATCTTTAAAAGAAGCA 
AATGTTGTC T AATGG AC AAAA CT AATG AG AACC ACT AG AG AAT AT AG AAATTTTCTTCGT 



LQQI TC.-DYSCSSLI SLKEA 
b Y N R L P V I i T L A H L t Y L " K K Q 

c 



TTO YLPCLLLtiSY IFKRSR 
GCCG AAA TCAAAAC AAC ACT AAAC AAAC AG ATTTCAAAATTTCTTCATTCTTC^ 
C CGC TTT AC TTTTC TTC TG ATTTC TTTC TCT AAAGTTTT AAAC AACT AAG AAG AC ATTGG 



C E M K R R L K K E IS KFVDSSVT 
b A X 1 C K t O RKR FONLL I L L ' P 

c 



RNEKXTKEROFK I CCFFCNR- 
GG AATT AAC AAC AAG AAT ATT AGC AACG AAAAAG AAG AAG ACCT ATCAC AATCCTG ATTC ^ ^ 
CCTT AATTGTTGTTCTT AT AATCGTTGC T't" I' T TCT TCTTCTCG ATAGTCTTAGGACTAAG 
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b ELTTRI -ATKKKKSYHNPOS- 

C k-0 0E Y "O? KRRRAITILIL- 



FIGURE 12 (cont.) 
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a 



TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAC 
AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATA 



LKISKIPGKRDTFIKIHI L * 

* rfqkfqvreihslxfi y y s 

KDFKNSR * - E R Y IH • N S Y I I V - 



TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTT^ 

_ — - — — — — — ♦ — — — — — — — — - — - - - — ♦ — — — • — — — — — — ♦ 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 



FFISQLLFSFILTIFFO'LE 
FSFHSCYFLLS •QYFLISWK 



c FHFTAV I F F Y LNN I FC LA G S- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 * * * * * * 1 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 



VKSIK'EKR-TEVT'LIHIH 
• KVSNKRSARLR • LSLFTF I 
KKYQIREALDCCNLAYSHS * 



a 
b 
c 

AGATCCACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

P 1261 - - — — - - — ' 1320 

g j TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

111 a RSTFIYPIRC'GNSSHPF-K 
^ b DRPSYIQY DDKETAV I R FKN 

c IDLHISNTM1RKQ0SSVLKI- 

j TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

H 13 2i - ^ 80 

Li ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGCCTTTAGAA 

2 a -cyED-IFRVKKWSRNLNQK 

r\ b SAMRTKFLESRNCAEILIKK- 

f 33 c V L C G L N F " SQEMEPKS * SKR- 

Li GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

r l381 - - 1440 

H! CTTAACGCAGCTATAACGTTTTCTTAGCTTGACATTTAGAAAGCAATTATTCATAATGGT 



ELRSYCKRIEL-IFR'-VLP 
b NCVDIAKESNSKSFVNKYYO- 
c IAS I L Q X NRT LNLS L 1 S I T N - 

ATCTTG ATTC ATTG AAG AG ATTG ACG AGGC AA.C TCC AC AG AAG A TC ATT AAAG AAAT AAA 

1441 • " 1*00 

T AG AACT AAC 7 AACTTCTCT AACTGCTC CGTTG ACCTGTCTTC T ACT AATTTCTTT ATTT 

a XLIOCRO.-RCNCTEOH-RMX 

b SCLIEEIDEATAOKIIKEI X- 

c LDCLKRL.TRQLHRRS L X K " 5- 

GTAACTTTTATTAATTAGACAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

1501 " l ^ 60 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTACAACTT 



V T F I N ' R I N ITNIE I SDLQ 

b •LLLIRE'TKLLI'RSAI FN 

c 



1561 



NF Y-LENKLNY-YRDO RSSI 
TTG ACG AAAT AAAAGCTG AACT AAAGTT AG AC AAT AAAAAAT AC AAAC CTTGCTC AAAAT 
AACTGCTTT ATTTTCC ACTTG ATTTC AATCTGTTATTTTTT ATGTTTGC AACC AGTTT T A 



1620 



LTK'XLN - S • T " I K N T N L G 0 N 
b C R N X S C T K V R Q ' K I 0 T L V K I 

c 



DEE XAELKLDNKXYK PWSXY- 
ATTGAGC AAGG AAAAC AAG ACC ACTT AGC AAAAG AAAAAAT AACCC AAT AAAT AAAATC A ^ 
T AACTCCTTCCTTTTCTTCTGGTC AATCG TTTTCTT TTTT ATTCCG TT ATTT ATTTT ACT 



a lEECXEOOLAKEKIRO SXC 

b LRKSXKTS-OKXX GNK-NE 

c CGR XRRPVSKRKNXAINKMS 



FIGURE 12 (cont.) 



CTACACA^CTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGACGGGTT 

1681 * 1 7 <° 

C ATGTC TTC ACTTCTTTATTTTCT AAAT AAAAAAAGTTATTAAATAACTTTTCT 

a v'qkcrnkrfiffnnllkrgv 

b YRSEE1KDLFFSIIYCKEC F- 

c TEVKK'KIYF FO'FIEKRCF-' 

TTGGGG TTTTGGGCTTTTGGGG 

1741 1762 

AACCCCAAAACCCCAAAACCCC 

a LCFWCFG 

b WCrGVLG- 

c C V L G F W 
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FIGURE 13 



2 E VDONO A DNHG I H S ALKTC E E I K EAKTL Y SW I QKV I RCRNQSQ SH Y KDL 51 

I ■ !:.::!: : | | - : - I • ■ I I • ■ I • 1 - I _ 

19 ELELEXOENONOIQVRVK ... IDDPKQY . . LVNVTAACLLQEGSYYQDK 62 

52 EDIXIrAOTNIVATPRDYNEEDFKVIARKEyr . STGLMIELIDKCLVELL 100 

63 DErWiITKALL. .. . E^AESDPEFICOLA\nTIRNEXYIRTTTNYI VAF . 107 

101 SSSDVSOR0KL0CFGF0LKGN0LAXTHLLTALSTQK0YFFQD^0>m^ 150 

C WHKNTOPF I EK YFNKAVLLPNDLLEVCEFAQVLY 1 14 4 



108 
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1 aactcattta attactaatt taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
61 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
241 atgtttgttg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcactt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
421 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
541 caaaaatttg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttta caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
661 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
721 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
841 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
961 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccttggt aaaaaatacx ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca- aaggacacac 
1621 tgcaatcttc tctgatgttt ctggttctat gaglacctca atgtcaggtg gagccaagaa 
1681 gtatggttcc gttcgtactl gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1741 acgttgtgaa aagtcctcat tctacatcn cagttcacct agttctcaat gcaataagtg 
1801 ttactta~gaa gttgatctcc ctggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacttggtg gtggtactga tttcccctat gagtgcattg atgaatggac 
1921 aaagaalaaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1981 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
204 1 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcttaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
222 1 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
2281 ctcaccccac U U tlg ttt tattgcatag ccattatgaa atttaaatta ttatctattt 
234 1 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atgattctgc 
2401 aaagaacaaa aaagattaaa a 



* 9 

FIGURE 20 

. MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLAVYIRNELYIRTTTN 
YIVAFCWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYIFDATEFKNLYLDRILS 
QDIRKELTFPJCCLQRCVRSKJSEFNEYQLGKYCTESQPvK^^ 
KJCRJ<ENLLTKLQAIKESEDKSKR£TGDIM^ 

APKJPNSTLESKYLTFKDLIKFCWSEPKERVYKJLGKXYPKTEEEYKAAFGDSASAP 
FT^PELAGKRMKIEISKTWENELSAKGNTAEVWDNLISSNQLP^^ 
f SDTTHSIVINKICEPKAVENSKMFPLQFFSAIEAVNEAVTK^ 

B AVKEVVEKTDEEKKDMELEQTEEGEFVKWEGIGKQYINSIELAIKIAVNKNLDEIKG 
£ HTAIFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGLMVKQRCEKSSFY1FSSPSSQ 
P CNKCYLEVDLPGDELRPSMQKLLQEKGKl^GGGTDFPYECIDEWTKNKTHVDNIVILSD 

E| MMIAEGYSDn^RGSSIVNSIIOCYKJDE\mPNIKUFAVDLEGYGKCLNLGDEFNEhn^ 

M 

M KJFGMSDSILKFISAKQGGANMVEVIKNFALQKJGQK 

0(1 

HI 



FIGURE 21 



1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
181 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaaglag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacngaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
961 acaagncct tgcgaagcgl tctaatattt agttaactcc tcatcataaa ttagcgttaa 
1021 agaiagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1621 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1741 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1 86 1 gtaiacmt aagtagaacg aatmaan taataacgtt aaaagtgcaa aaattgaatc 
1921 ttcctcatta gaaagcttag aagatattga tagtctttgc aaatctattg cttcttgtaa 
1981 aaatttacaa aatgttaata ttatcgccag mgctctat cccaacaata tttagaaaaa 
204 1 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactl aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
222 1 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
234 1 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatl ccctctatgt ttaccaactg gtacttatta 
264 1 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattm agtgttttga 
2761 atatatttta gttatttaat tcattatttt aagtaaataa natttttca atcatttttt 
2821 aaaaaatcg 
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FKQRLLKKF>^LPELYFMKFDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFN 

5 TNTGVLKLFNT/VNASRVPKPYELYIDWRTV^ 

u 

U J IREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILKLADDFLIISTDQQQV 

m lNIKJ<JLAMGGFQKYNAK^NRDKJLAVSSQSDDDTVIQFCAMmFVKELEVWKHSSTMN 

h 

H NFHIRSKSSKGIFRSLIALFNTRISYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 



KDLSrNVTQNMQFHSFLQRIlEMTVSGCPITKCDPLIEYEVRFTILNGFLESLSSNTS 
jj KFKJDNIILLRKEIQHLQAY1YIYIHIVN 



WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLR 



QSPKERVLKFIIVILQKLLPQEMFGSKKNKGKJIKNLNLLLSLPLNGYLPFDSLLKKL 



P 



9 » 

FIGURE 24 



Oxytricha LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 
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Motif 0 

AKPLHWLKSVYWE1XRSPFYVTETTFQKHR 
hVUnan 1 8B I EWLVLGKRS W AXMCLSDFEKRKQI F AEF Z YWLYHSFI * || ^V^ppy^g j gg^^ 



Motif 1 

LFFYRK9V*SKL C SIGIRQHLKRVQLRDVSEAE^^ 
^YFRKDIWXLLCRPPI-TSMKMEAPEKIMEMHVRMDTQK-TTIjPP^ 

mFRHDTWNKLITPFIVEYFKTYLVEMVVCRN^^ 

tyyyrkhiwdvimkmsi-aj>lkketlaevqekevbewkks-ix>fapgklrlipkk--ttf 



human 
tezl 
EST2 
pl23 



«:5 

kS2 ^IA^cSeE^!^ 

3S RPlMTFRKKIVNSDRKTTKLTTHTKLLHSHIJllJtTLKH-RMFKDPPGPAVFHYDDVMK^ 

p r* . . ♦ * " 

□ ,,«uoupnR KKYFVR-^IKSCYDRIKODLKFRIVKR-KLKDPEPVIRKYATIHATS 

III BBPVCKWKQVGCPKLFFATMDIEKCYDSVNREKLSTPLKTTKLLSSDFWIMTAQILKRKM 



D 



FIGURE 26 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTGAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTCACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

a^accIaaggaacgagtcttgaj^^ 

AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCAGAATTTTGAAAACTT 

GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 

ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 

CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

A^^GATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCA^^ 

AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 

CACTCA^AAAATTTTAGAATACCTAAGAAACAAAAGGCGGACTAGTTTTACTAAAATATA 

TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 

taatStctcWcagag 

CA^ACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 

?ttcg™gatctcaatatttcttcaataccaatacaggtgtattgaagttatt^ 

TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 

?S™atcaaatcaggatgttat^ 

GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
?GCTC?GA^GTTGATTTGGTGTATGACG 

CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 

agaccaI2agcaagtgatcaatatcaaaaa 

TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGA 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 

CACAATCAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 

Arr^TGT^AACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 

CACCG^CT^ATGCAAATTCATCATGTTGTAAAGAACATTTCGGAATGTTATAAA 

5tSgSc?atcaattaatgttacgcaa^ 

cItcattgaaatgacagtcagcggttgtccaattacgaaatgtgatcctttaatc 
?gIgg™gattcaccatattgaatgga^^ 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 
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W FIGURE 27 V 



AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIG1RQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 



FIGURE 28 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

TTTCTTTTATGTCACGGAGACCACGTTTCAAAAG AAC AGGCTC 1 1 1 1 lCTACC 

GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 

GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 

GTGCTCAACTACGAGCGGGCGCG 



FIGURE 29 



MTEHHTPKSRJLRFLENQYVYLCTLNDYVQLVLRGSPASSYSN1CERLRSDVQTSFSIFLHSTVVGF 

DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 

STFPNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKMNVFEETVSKXRK 

RTIETSITQNKSARKEVSWNSISISRFSIFYRSSYXKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 

LrNAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKlLS 

YSLKPNQVFAFLRS1LVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIK1SEIEWLVL 

GKRSNAIO^CLSDFEKJIKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCRPFIT 

SMKMEAFEKINE^PJVIDTQKTTLPPAVIRLLPKKNTFRLITNLRIOIFLIKMGSNKXMLVSTNQT 

LRPVASILKHLFNEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVR1DIKSCYDRIKQDLMFR 

1VKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

WTKSSSEIFKMLK.EHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLHTVTJIO^AKJa^LNLSLRGFEKJWFSTSLEKTVWFENSNGIINNTFFNESKKRMPFFG 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCN1 

YRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRJAD 
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FIGURE 30 

^ccgamacraccmcncataagctaattg^ 

^nanagtgatcgataatamctatttutcggtcgttaccaagtataaggacaaaaagaacaacttccncccccuaagactttu^ 

incgcnactmaatcgtggtactgtmagctgcucttctagccaaccgcgtgmctaccccgtcattggatatagctctt^^ 

«»atgagactauinagancanacagtccgtgcata«cttaacatggagccttacactttagatgagtcac^^ 

aaaagWtaattamgcaaaatcatgtcct^^ 

ctccmaacgcggttiuttmctatmctanctcatgttgttccaaatatgtatcatctcguttaggcttt^ 
aataatctaaanagmcgcttataangatagtagtagaaagattggtg^^ 



CCTTAAATGATTATGTACAACTT6TTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 
CGCTTGAGAAGCGATGTACAAACGTCCTTTTCTATTTTTCTTCATT 

AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTCACAGTCAGAGgUtatatattmgtmgatttttttctettcg 

ATCTACTGATGAAAGGGTTTTCCATGgtaaggtanctaattgtgaaatatttacctgcaattactgmcaa^^ 
TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTC 
CCTTATATCTATACTTGAGTCAAAAAATTGGCAACnTTTGTTAGAAATgta^ 
agactgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAA^ 
CAAATGACAATTACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTTTGAGGA^^ 
CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAG^TTTC 

CTG^GAA^TA^CA"nTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATC 

TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 

Iotga^cJactggtatcacagagtacagttgtgcccaaacgtctcct^^ 

r CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 
^ CA^C^G^TCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 

ATTATrTAATGAGTAACATAAAGgtaatatgccaaatttttttaccattaanaacaatcagATTTCAGAA^ 
CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGA^ 
AATTCATCTACTGGCTATACAATTCGTTTATAATACCTATTTTACAATCTTTTTm 

aagTgat^acgaaatcgaactgtita^ 

TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtannaaagwtttmgcaaaaagctaatatmcagAACAA 
t TG^AGGATGGATM^TCAGAAAACTACTTTGCCTCCAGCA 

83 rTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgunaattmggtcatcaatgjacmacnctaatctattottag 

0 calATGGG™ 

TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgancctcattattaatmgcagGCGTAAGAAG 
TAmCTACGG^ 

Ar.rTACAAAAAACTTTGTTAGTGAGGCGTTTTCCTATTgtaagm 
rrTrrCTrrTGAAAAAGTCGTGCAGTTAGTTTCTATGAAAACATC 

G^A^AC^ 

AGGATGCAAAAAAATTTTTGAATTTATCTTTAAGAGgtgagttgctgtcattcctaagttctaaccgttgaagGATTTC 
ACACAATTTn^TACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGA^ 

^^^^^ 

^ACA^AGCATGTCCTAAAATTGATGAAGCCrrATTr^ 

ATATGGGGAAAT CI i Hill l ACAAAATTCTAAGgtatactgtgtaactgaalaatagctgacaaamtcagATCGAGCCI l IUI 

atcJ^gcaca^^^ 

AAAGAATGTTCATAACGGgtgagtacttatmaactagaaaagtcattaattaaccttagATCTTTTGAATGTTATTGGAAOAAAA 
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ATGetacgtetcggtctcgagacttcagcaatangacacatcagGCTTTTTTGTCTTGGAATC 
TCAAATATC^^^ 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAAGAATAGCTGATTAAtgtcatmcaamatlatatacatcctt 
tanacteMgtcnaaacaatananactaagtatagctgac^ 

atccnaucLaagaaagangacagtggngctgactactgcccacatgcccatUaacgggagtggttaaacatuaaagtaatacatgag^^ 



acwar-"""""^ ~™ • m ' ,p ' : ,c 1 'PcnctecaeatE icaet < ta lattttggtgai cgaattttggtaaaagcccc; ggttatcc rtggtggccg 



LtaagccattTng^ 
Latcttccaccga^ 

gagtatctccagcggatccngatgtcaataacnctamctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 
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FIGURE 31 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

fsA Consensus 

□ 

O EST2 pep 

If] Euplotes pep 

$ Trans of cecrahymen 



FFYCTEISST V*IWFRHOT WN KLIT I FIVE V7K-TYLVEN 

FFYVTEQQKS YSKTYYYRKN IWDVI-MKMS IAD LKK ETLA--EVQE 

KHKE GSQIFYYRKP IWKLVSKLTI VKVRIQrSEK NKQMKNNFYQ 



FFY.TE. . K. -S. . YYYRK. IW. 



-KL. 



.— F. .K 



.V. . 

MPCRG 
TTFNKK 




NVCRNHNSY TLSNFNH^** 

KEVE3EWKKSL GFAPG 

KIQLJSEENLE KVEEKLIPED SFQKYPQC 
F F..C 



ADEEBFTIYK ENHKNAIQP^QKILEjfeiJK FIPTSFTKIYS 

rVNSDRKTTK LTT^Ll^^fc — RKFK 

DKOKNIK U^II>lD^SLVFW#k£- ML-G -Q.<IGYSVFt> 



. K. .K LN.N. .L . .S-QL.L. 



.IG. .YF. 



[31 Consensus 



FKQRLU4 
YD-DVMH 
NK-QISH 



FN NVL 

She efvckwkqvg c 



-pfefQ^FMKFD VKSCYD 
ATMD IEKCYD 



IpA QFIEKWKNKG 

.KWK. .G . 



EttF .T. D . . -CYD 



40 
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44 

50 



79 
78 
92 

100 



129 
120 
130 
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FIGURE 32 



S- 1 : FFY VTE TTF QKN RLF FYR KS V WSK 
S-2: RQH LKR VQL RDV SEA EVR QHR EA 
S-3: ART FRR EKR AER UTS RVK ALF SVL NYE 

A-l: AKF LHW LMS VYV VEL LRS FFY VTE TTF Q . 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FTP KPD GLR PIV NMD YW 
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Vector Genomic DNA Insert 



Vector 



A5 



5825 bp Sequenced 
-2 kb Hind III Fragment 



1 kb 



tezV 



RT Motifs 12 3(A) 4(B*) 5(C) 6(D) 
1/ \ \ 



Introns 



2 J 



1 cDNA 
3U5-2dcDNA 




7 8 9 10 11 



I 

Hind III Xca I 



12 13 14 15 



Original PCR 
3' RT-PCR 



Xca I 



■ 



RT-PCR w/ M2-B14 
RT-PCR w/ M2-B15 Bdnc 
RT-PCR w/ M2-B15Bcnc 



RT-PCR w/ M2-B16 Bene C 



Hind 111 



500 bp 



FIGURE 34 



Poly 4 

t t c 

t a a g c c teg 
5'- cag acc aaa gga att cca taa gg -3' 
QTKGIPQG 



4(B') 



□ 

m 

01 

H 5 (c') 

Q D D Y L L I T 

H 3 '" ct 9 ct 9 atg gag tag tgg -5' 

f»S a a a a a a a a a 

m t t t t 

□ c c 

flj PolY_l 



FIGURE 35 
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PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 

ot LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 

Ea_pl23 KGIPQGLCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLIT 

Sp M2 SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

ScTpl03 DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS QDTLI LKLADDFL IIS 



QKVGIPQG 

caa aaa gtt ggt acc ccc cag gg <— Actual Genomic Sequence. 

□ t t c 
cgjagcctcg 
cajgf! acc aaa gga act cca taa gg > 

an acc aaa gga aCC cca cca ggC TCA ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG 
tp| tgg ttt cct taa ggt agt ccG AGT TAA GAC AGT AGA AAA AAC ACA GTA AAG ATA TAC 

•' fe KG I PSGSILS SFLCHFYM 

a 

G^J GAT TTG ATT GAT GAA TAC CTA TCG TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA 
CT$ CTA AAC TAA CTA CTT ATG GAT AGC AAA TGC TTT TTC TTT CCT AGT CAC AAC AAT GCT 

E W D L I DEY LS FTK K KG SVLLR 



GTA GTC gac gac tac etc etc ate acc 
CAT CAG ctg ctg atg gag gag tag tgg 

VVD DYLLIT 

< ctg ctg atg gag gag tag tgg 

a a aaaaa a a 
t t t t 
c c 

poly i 

gac gat ttc etc ttt ata aca <---Actual Genomic Sequence. 

D D F L F I T 



» „ \ 
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3' RT PCR Strategy 



| AXAXAAXAXAAAAAXX 
TTTTTTTTTTTTTTT 



3* 



1 . Synthesis of cDN A with Primer. 
mRNA 



1AXAAAAAAAAXAAXXX 
I TTTTTTTTTTTTTTT- 



&irst Round PCR Using Outside Primer ond Qq Mm* 




& Second Round PCR Using Inside Primer ond Q| Primer 



D 



Sequence 



Second Round PCR Products Using Inside Primer or Q| Primer. 




FIGURE 38 



6. 



-Size Selected Libraries from P. Nurese 
3 - 4 kb 
5-6kb 
7 -8kb 
11 - 12 kb 

-Libraries from J.A. Wise 
Sau 3a Partial Digest 
Hind III Partial Digest 

cDNA Libraries 

GAD (Gal Activation Domain) Library 
REP Library from R. Allshire 
REP81ES Library (old) 
REP81ES Library (new) 
jf REP41ES Library 

a 



6. 
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FIGURE 39 



A5 B2 




p Hind III Digested Positive Genomic Clones 

m 
a 



» 

FIGURE 40 



5' RT PCR Strategy 



5 f 



mRNA 



l AAAAAAAAAAAAAAAA . 



ddA- 



1 . Synthesis of cDN A with Specific Downstream Primer. 



mRNA 

5* ^b^b^^^^^^^^^^^HI^HHMMHMHHII^^H^^BMBAAXJUUUUUUUUULXAA. 



|*2. Ligate Oligo with 5'-P and blocked 3* to cDNA using T4 RNA Ligase. 



□ 

Ul 



in 

^3. First Round PCR 

C3 

01. Second Round PCR 
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Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif 0 

S.p- Tezlp (429). WLYNSFIIPILQSFFYITES5DLRNRTVYFRKDIW ...(35)... 

S c. Est2p (366) . WLFRQLI PKI IQTFFYCTEI SSTVT- IVYFRHDTW . . . (35) . . . 

E a. pl23 (441). WI FEDLWSLI RCFFYVTEQQKSYSKTYYYRKN IW ...(35)... 



★ * * * ★ 



Motif 1 Motif 2 K 
p hh h K hF. h R 
S.p. Tezlp AVIRLLPKK- -NTFRLITN-LRKRF . ... (61) . . . 
S.c. Est2p SKMRI I PKKSNNEFRIIAIPCRGAD . . . (62) . . . 
E.a. pl23 GKLRL I PKK- -TTFRPIMTFNKKIV ...(61)... 
* * * * * * * 

Motif 3(A) AF 
b h hDh GY h 

? S.p. Tezlp KKYFVRIDIKSCYDRIKQDLMFR IVK . . . (89) . . . 
Q S.c. Est2p EL YFMKFDVKSCYDS I PRMECMR I LK 1 . . (75) . . . 
^ E.a. pl23 KLFFATMDIEKCYD3VNREKLSTFLK ...(107)... 

tfs ~ ***** * 

Bl Motif 4(B') 

M hPQG pP hh h 

K S.p. Tezlp YLQKVGI PQGSILS SFLCHFYMEDLIDEYLSF ...(6)... 

» S.c. Est2p YIREDGLFQGSSLSAPIVDLVYDDLLEFYSEF . . . (8) . . . 

□ E.a. pl23 YKQTKGIPQGLCVSSILSSFYYATLEESSLGF ...(14)... 
\,k ***** * * 

^; Y Motif 5(C) Motif 6(D) 

|fi h F DDhhh Gh h cK h 

0 S.p. Tezlp VLLRWDDFLFITVNKKDAKKFLNLSLRGFEKHMFSTSLEKTVINFENS .(205) 

m S c Est2p LI LKLADDFLI ISTDQQQVINIKKLAMGGFQKYWAKANRDKILAVSSQS . (173) 

" Ea pl23 LLMP.LTDDYLLITTQENNAVLFIEKLINVSRENGFKFNMKKLQTSFPLS .(209) 
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FIGURE 43 

Disruption strategy for the putative telomerase genes. 
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1 . Transform with linear fragment 
containing the telomerase gene 
disrupted with a LEU2 or uro4 marker. 



m 



a 
a 



M 



telomerase gene::M 
wild type telomerase gene 




wild type telomerase gene 



Wild Type (X2) 



2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 




Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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An Example of Confirmation of tezl disruption By PCR 
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Tezl disruption causes progressive 
shortening of telomeres in S. pombe 
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FIGURE 46 
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20 



1 ggtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttttacaaga 80 
81 actcaataacaataccaagtcaaattccaatatgaaggtgttattagtgatcgataatatttctattttatcggtcgtta 160 
161 ccLgtataaggacaaaaagaacaacttccttccccctaaagacttttactttattaatttacttttcaaatatatttcg 240 
241 ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatat 320 
321 ^gctcttggagtagctcacagaaatccttacaaatcttctgatgagactatattagattcattacagtccgtgcatattc 400 
01 ttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttgaaaag 480 
481 gttgataattatttgcaaaatcatgtccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatg 560 
561 attgagatattcaaaaattcctatccactacaactcctttaacgcggttttatttttctattttctattctcatgttgtt 640 

6 1 ccaaaStgtat^ 720 
121 ataatctaaattagtttcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaa 800 

801 gatac^tgcaaaLatttattagctatcat^ 880 
881 actattta^ttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 
1M TEHHTPK SRILRFLENQYV 

1019 TAG CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 1078 
21YLCTLNDY VQLVLRGSPASS 

1 07 KtAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 1138 

4 |y snicerlr sdvqtsfsifl 60 

llsfficAT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
6 | H s T V V G F D S K P D E G V Q F S S P 80 
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FIGURE 46 (cont.) 

1199 AAA TGC TCA CAG TCA GAG gtatatatatttttgttttgat t tt tt tctat tcgggatagctaata tatgggcag 1272 

81 K C S 0 S E 86 

1273 CTA ATA GCG AAT GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA 1332 

87L I ANVVKQMF DES FERRR N L 106 

1333 CTG ATG AAA GGG TTT TCC ATG gtaaggtattctaattgtgaaatatttacctgcaattactgtt tcaaagaga 1405 

107 L M K G F S M 113 

1406 ttgtatttaaccgataaag AAT CAT GAA GAT TTT CCA GCC ATG CAT GTA AAC GGA GTA CAA AAT 1469 

114 NHEDF RAM H V NGVQN 128 

1470 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 

129 DLVST .FPNYL I SI LESK NWQ 148 

1530 CTT TTG TTA GAA AT gtaaataccggttaagatgttgcgcactttgaacaagactgacaagtatag T ATC GGC 1601 

149 L L L E I I G 155 

1602 AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 1661 

156 SDAMHYLLSKGSI FEALPND 175 

M 

166SAAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG TTT GAG GAA ACT GTG 1721 

17C3N YL QISGIPL FKNNVFEETV 195 

172?|STCA AAA AAA AGA AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 1781 

19f is KKRKRTIET S I T QNKSARK 215 

178L-GAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 

21§ E VSWNS.ISISRFSIFYRSSY 235 

P 

184|^AAG AAG TTT AAG CAA G gtaactaatactgttatccttcataactaattttag AT CTA TAT TTT AAC 

19o£ k 

23feK KFKQD LYF N245 

190$1tTA CAC TCT ATT TGT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 

246 M L HSICDRNTVHMWLQWI FPR 265 

1968 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA 2027 

266 QFG LIN AFQVKQLHKVI PLV 285 

2028 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

286 SQSTVVPKRLLKVYPLI EQT 305 

2088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

306 AKRLHRISLS KVYNHYC PY I 325 

2148 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 2207 

326 DTHDDEKILSYSLKPNQVFA 345 

2208 TTT CTT CGA TCC ATT CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 2267 

346 F LRS I LVRV F P K L I W G N Q R I 365 



2268 TTT GAG ATA ATA TTA AAA G gtattgtataaaatttattaccactaacgattt taccag AC CTC GAA ACT 233 
366 F E I I L K D LET 375 
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FIGURE 46 (cont.) 
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2706 gtattttaaagtattttttgcaaaaagctaatattttcag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 2775 

NNVRnUi v ^ * 
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2-m ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CGT CTC ATT ACG 2835 
^ 6TLP PAVIRLLPKKNTFR LIT 515 

2^6 AAT TTA AGA AAA AGA TTC TTA ATA AAG gtattaatttttggtcatcaatgtactttacttctaatctatta 2906 
| S |6 N L R K R F L I K 

2^07 ttagcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 
P 5 M GSNKKMLVSTNQTLRPV 542 

2p68 GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3027 
1*43 A S I LKHLI N E E S SG I P FN L E 562 

3§8 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gtaat 3088 
Q 3 V Y M K L L T F K K D L L K H R M F G 581 

3089 tatataatgcgcgattcctcattattaattttgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 
^ RKKYFVKXt-' 



582 



3156 AAA TCC TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 
592 K S C Y D R I K Q D L M F R I V K K K L 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA GCT 3275 
612KDPE FVIRKYATIHATSDR A 

3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gtaagtttattttttcattggaattttttaacaa 3343 
632 TKNFV SEAFSYF 

3344 attcttttttag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 
644 DMV PFEKVVQLLSMKT65y 

3406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3465 
660 S D T L F V D F V D Y W T K S S S E I F 

3466 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT. GTT AAG gtataccaattgttgaattgtaataaca 3532 
680 KMLKEHLSGHIVK 



FIGURE 46 (cont.) 

3533 cta« g ..»C ta9 ATA GGA AAT TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA 3593 

IGNSQY L»Q 



,594 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCG 3653 
709 ILS SFLCHFYMEDLI DEYLS 728 

3654 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA ACA 3713 
729 ftKKKGSVL LRVVDDFLFIT 748 

3714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gtgagttgctgtcattcc 3777 
749 V N K K D A K K F L N L S L R G 

3778 taagttctaaccgttgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 3840 
765 FEKHN'FSTSLEKIV 

3841 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA : AGC AAG AAA 3900 
779 I NFENSNGI I NNTF FNESK K 798 

3901 AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA GCA TGT 3960 
799 rmPFFGFSVNMRSLDTLLAC 818 

394* CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG 4020 
8111 P K I D . E A L F N S T S V E L T K H M G 838 

4oifl AAA TCT TTT TTT TAC AAA ATT CTA AG gtatactgtgtaactgaataatagctgacaaataatcag A TCG 4089 
83p KSFFY KI L» R 

40<E AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 
84a S LASFAQVF I D I THNS K FN 

4150 TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT ATG AGA GCA CAA GCA TAC TTA AAA 4209 
8<pjc C N I Y R L G Y S M C M R A Q A Y L K 888 

42 B AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gtgagtacttattttaactaga 4274 

bSrmkdifipqRMFITD 



42 W aaagtcattaattaaccttag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 4339 
904 L L N V I G R K I W K K L A Si./ 

4340 GAA ATA TTA GGA TAT ACG AGT AGG CGT TTC TTG TCC TCT GCA GAA GTC AAA TG gtacgtgtc 4401 
918 EILGYTSRRFLSSAEVKW 935 

4402 ggtctcgagacttcagcaatattgacacatcag G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA 4468 
936 

4469 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4528 

947 P S F K Y H P C F E Q L I Y Q F Q S L T 966 

4529 GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4588 
967 D L I K P L R P V L R Q V L F L H R R I 

4589 GCT GAT TAA tgtcattttcaatttactatatacatcctttattactggtgtcttaaacaatattattactaagtata 4665 
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4666 gctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttactagtcttgattgacttgtct 4745 
4746 ttatccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaaca 4825 
4826 ttaaaagtaatacatgaggctaatctcctttcatttagaataaggaaagtggttttctataatgaataatgcccgcacta 4905 

4 906 atgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatatacccagtgtt 4 985 
4986 gttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattctggtgaccgaattttggtaaaagc 5065 
5066 cccaggttatccacggtggccggccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcattta 5145 
5146 atgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaatagtgttaagccattattggat 5225 
5226 tccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcctcaaaaactcc 5305 
5306 tcctgatttaaaggaggaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgc 5385 
5386 aaaaaagaaaatatcattgggagacatctcttgatgaatcagatgcggagagtatctccagcggatccttgatgtcaata 5465 
5466 acttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 5544 
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i 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg' val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser "arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 



P 100 110 

qlu lys ala glu arg leu thr ser arg val lys ala leu phe 
I** GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

m 

Q 120 

HJ ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 

AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG CGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 

GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT CTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 u n 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 

TTT GTC AAG CTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arq leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 

ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 

210 

qly thr ser ala arg pro ser arg ala thr ser tyr val gin cys 
GGC ACG TCC GCA AGG CCT TCA AGA GCC ACG TCC TAC GTC CAG TGC 

220 230 

qln gly ile pro gin gly ser ile leu ser thr leu leu cys ser 

CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC 

240 

leu cys tyr gly asp met glu asn lys leu phe ala gly ile arg 
CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG ATT* CGG 

250 260 ^ . . . 

arq asp gly leu leu leu arg leu val asp asp phe leu leu val 
CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG TTG GTG 

270 

thr pro his leu thr his ala lys thr phe leu arg thr leu val 
ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC CTG GTC 



280 



290 



arq gly val pro glu tyr gly cys val val asn leu arg lys thr 
CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG AAG ACA 

300 

val val asn phe pro val glu asp glu ala leu gly gly thr ala 
GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC ACG GCT 



310 



320 



phe val qln met pro. ala his gly leu phe pro trp cys gly leu 
TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC GGC CTG 

330 

leu leu asp thr arg thr leu glu val gin ser asp tyr ser ser 
CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC TCC AGC 



340 



350 



tvr ala arg thr ser ile arg ala ser leu thr phe asn arg gly 
TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC CGC GGC 

360 

phe lys ala gly arg asn met arg arg lys leu phe gly val leu 
TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG GTC TTG 

370 380 , 

arq leu lys cys his ser leu phe leu asp leu gin val asn ser 

CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG AAC AGC 
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390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 

420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG .GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

M 460 470 

O 
□ 

Ul 
42 
til 



m 
u 

flj 



arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 



4 90 500 
f «, pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
H CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 



510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arq qly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 



CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
5 1 CCGCG ATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
1 0 1 AGCC ACTACC GCG AGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
1 5 1 GCCCC AGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
25 1 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
301 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 
351 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 
40 1 ACCACCAGCG TGCGCAGCTA CCTGCCC AAC ACGGTGACCG ACGC ACTGCG 
45 1 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
50 1 TGGTTC ACCT GCTGGC ACGC TGCGCGNTNT TTGTGCTGGT GGNTCCC AGC 
551 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
601 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
65 1 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
701 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
75 1 GTTGCCCAAG AGGCCC AGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
801 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 
85 1 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
90 1 CTCTTTGG AG GGTGCGCTCT CTGGC ACGCG CCACTCCCAC CCATCCGTGG 
95 1 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 
1 00 1 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACC AAGCA CTTCCTCTAC 
1 05 1 TCCTCAGGCG AC AAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 
1101 AGGCCCAGCC TG ACTGGCGT TCGGGAGGTT CGTGGAG AC A NTCTTTCTGG 
1151 TTCCAGGCCT TGG ATGCC AG GATTCCCCGC AGGTTGCCCC GCCTGCCCC A 
1 20 1 GCGNTACTGG C AAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
1 25 1 CGCAGTGCCC CTACGGGGTG TTCCTC AAG A CGCACTGCCC GCTGCG AGCT 
1 30 1 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
1 35 1 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
1401 CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 
1 45 1 CTGCCTGCGC CGGCTGGTGC CCCC AGGCCT CTGGGGCTCC AGGC ACAACG 
1501 AACGCCGCTT CCTCAGG AAC ACC AAGAAGT TCATCTCCCT GGGGAAGC AT 
1 55 1 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
1 60 1 CGCTTGGCTG CGC AGGAGCC C AGGGGTTGG CTGTGTTCCG GCCGC AGAGC 
1 65 1 ACCGTCTGCG TGAGG AG ATC CTGGCC AAGT TCCTGCACTG GCTGATGAGT 
1701 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGG AGACC AC 
1 75 1 GTTTCAAAAG AAC AGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
1 80 1 TGCAAAGCAT TGG AATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 
1 85 1 CTGTCGGAAG CAG AGGTC AG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 
1 90 1 GACGTCC AGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
195 1 TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
2001 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
205 1 CG AGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCC A CAGGGCCTGG CGC ACCTTCG TGCTGCGTGT GCGGGCCC AG 
2151 GACCCGCCGC CTG AGCTGTA CTTTGTC AAG GTGGATGTGA CGGGCGCGTA 
220 1 CG ACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AGC ATCATCA 
225 1 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 
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230 1 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
23 5 1 AG ACCTCCAG CCGTACATGC G AC AGTTCGT GGCTC ACCTG C AGG AN AAC A 
2401 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
245 1 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
2501 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
255 1 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGAC ATGGAG 
260 1 AAC AAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
265 1 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
270 1 TC AGG ACCCT GGTCCG AGGT GTCCCTG AGT ATGGCTGCGT GGTG AACTTG 
275 1 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
280 1 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
285 1 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
290 1 CGGACCTCCA TCAGAGCCAG TCTC ACCTTC AACCGCGGCT TCAAGGCTGG 
295 1 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCAC A 
300 1 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGC ACCAAC 
305 1 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGT GTGCT 
3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCCAC A TTTTTCCTGC 
3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCC AAG 
320 1 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
325 1 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
3301 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 
335 1 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
3401 CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGGACTGAT 
345 1 GGCCACCCGC CCACAGCCAG GCCGAGAGC A GACACCAGC A GCCCTGTC AC 
350 1 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
3551 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGC ATGT 
3601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
365 1 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
370 1 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCC A 
3751 CTCCCC ACAT AGGAATAGTC CATCCCCAG A TTCGCCATTG TTCACCCCTC 
3801 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG GAGACCCTGA 
385 1 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
3901 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTC A AATT 
3951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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FIGURE 53 




1 

met 



GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro ara ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg clu val leu pro leu ala thr phe val arg arg leu 
CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly oro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 



phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg oro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 



leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG A^C GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 



ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



80 



90 



110 



120 



130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 
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140 150 
trp glv leu leu leu arg arg val gly asp asp val leu val his 
TGG GGC- CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 
ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAG CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 

200 210 
leu glv cys glu arg ala trp asn his ser val arg giu ala gly 
CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

ala ala oro glu pro glu arg thr pro val gly gin giy ser trp 
GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

260 270 
ala his oro gly arg thr arg gly pro ser asp arg gly phe cys 
GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 

280 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 
GTG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 
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29 0 3C0 
giv ala 1 eu ser gly thr arg his ser his pro ser val gly arg 
GGT.GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 

310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 
CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
trp asp thr pro cvs pro pro val tyr ala glu thr lys his phe 
TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 

340 

leu tvr ser ser gly asp lys glu gin leu arg pro ser phe leu 
p CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

£ 350 3t>0 

ai leu ser ser leu arg pro ser leu thr gly aia arg arg leu val 
CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC C-CT CGG AGG CTC GTG 



370 

P glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 
GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 



380 390 
arg aro leu pro arg leu pro gin arg tyr trp gin met arg pro 
CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr gly 
CTG TTT CTG GAG CTG CTT GGG ZAC CAC GCG CAG TGC CCC TAC GGG 

410 420 
val leu leu lys thr his cys pro leu arg aia ala val thr pro 
GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 

430 

ala ala glv val cys ala arg glu lys pro gin gly ser val ala 
GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC TCT GTG GCG 
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440 450 
ala pre glu glu glu asp thr asp pro arg arg leu val gin leu 
GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG CAG CTG 

460 

lo U arg gin his ser ser pro trp gin val tyr gly phe val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC.TTC GTG CGG 

470 480 
ala cvs leu arg arg leu val pro pro gly leu trp gly ser arg 
GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

his asn glu arcr arg phe leu arg asn thr lys lys phe ile ser 
CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

M 500 510 

O leu gly ivs his ala lys leu ser leu gin glu leu thr trp lys 

jjj CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

| 520 

h 6 met ser val arg asp cys ala trp leu arg arg ser pro gly val 
H ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

0 530 540 

gly cys val pro ala ala glu his arg leu arg glu glu ile leu 

jj GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 

□ 

flJ 550 

ala lys ohe leu his trp leu met ser val tyr val val glu leu 

GCC AAG TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

560 570 
leu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg pro ser val trp ser lys leu gin ser 
AGG CTC TTT TTC TAC CGG CCG AGT GTC TGG AGC AAG TTG CAA AGC 

590 600 
ile glv ile arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 
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ru 
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610 

cer clu ala glu val arg gin his arg glu ala arg pro ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 «° 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG OCT GAC GGG CTG CGG 

640 

pro ile val asn met asp tyr val val gly ala arg thr phe arg 
CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 

650 660 
arg glu ivs ara ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 



f 680 690 



glv a"! a ser val leu glv leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 



P 700 

H thr phe val leu arg val arg ala gin asp pro pro pro glu leu 

4 ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

5 710 7 20 
tvr phe val lvs val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

720 

gin a<=p arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr fyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 



FIGURE 53 (cont.) 



770 "80 
thr asp ieu gin pro tyr met arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 

790 

glu thr ser pro leu arg asp ala val val ile glu gin ser ser 
GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 

300 , 810 

ser leu asn glu ala ser ser gly leu phe asp val phe leu arg 
TCC CTG AAT GAG GCC AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC 

820 

phe met cys his his ala val arg ile arg gly iys ser tyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 

830 840 
crln cys aln gly ile pro gin gly ser ile ieu ser thr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 

850 

cys ser Ieu cys tyr gly asp met glu asn lys Ieu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 

860 870 
ile arg arg asp gly leu leu leu arg leu val asp asp phe leu 
ATT CGC- CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 

880 

leu val thr pro his leu thr his ala lys thr phe leu arg thr 
TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 

890 900 
leu val arg gly val pro glu tyr gly cys val val asn leu arg 
CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG 

910 

lys thr val val asn phe pro val glu asp glu ala leu gly gly 
AAG ACA GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC 

920 930 
thr ala phe val aln met pro ala his gly leu phe pro trp cys 
ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC 



FIGURE 53 (cont.) 

940 

gly leu leu leu asp thr arg thr leu glu val gin ser asp tyr 
GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC 

950 960 
ser ser tvr ala arg thr ser ile arg ala ser val thr phe asn 
TCC AGC TAT GCC CGG ACC TCC ATC AGA GCC AGT GTC ACC TTC AAC 

970 

arg gly one lvs ala gly arg asn met arg arg lys leu phe gly 
CGC GGC TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG 

980 990 
val leu arg leu lys cys his ser leu phe leu asp leu gin val 
GTC TTG CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

asn ser leu gin thr val cys thr asn ile tyr lys ile leu leu 
AAC AGC CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG 

1010 1020 
leu gin ala tyr arg phe his ala cys val leu gin leu pro phe 
CTG CAG GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT 

1030 

his gin gin val trp lys asn pro thr phe phe leu arg val ile 
CAT CAG CAA GTT TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC 

1040 1050 
ser asp thr ala ser leu cys tyr ser ile leu lys ala lys asn 
TCT GAC ACG GCC TCC CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

ala glv met ser leu gly ala lys gly ala ala gly pro leu pro 
GCA GGC- ATG TCG CTG GGG GCC AAG GGC GCC GCC GGC CCT CTG CCC 

1070 1080 
ser glu ala val gin trp leu cys his gin ala phe leu leu lys 
TCC GAG GCC GTG CAG TGG CTG TGC CAC CAA GCA TTC CTG CTC AAG 



1090 

leu thr arg his arg val thr tyr val pro leu leu gly ser leu 
CTG ACT CGA CAC CGT GTC ACC TAC GTG CCA CTC CTG GGG TCA CTC 



ft 
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1100 i 1 - 10 
arg thr ala aln thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu th- ala leu alu ala ala ala asn pro ala leu pro ser asp 
CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe ivs thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

h* CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGA.GGGGCGGCCCACACCC 

□ 

5 AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 



1 GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 
CAGCACACCTGCCQTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 



R CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

u 

m CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

o 

fJ GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 
CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 
TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAJ\AAAAAAAAAAAAAA 

AAAAAAAA*A 
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